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Abstract 

We study the maximization version of the fundamental graph coloring problem. Here the goal is to 
color the vertices of a fc-colorable graph with k colors so that a maximum fraction of edges are properly 
colored (i.e. their endpoints receive different colors). A random fc-coloring properly colors an expected 
fraction 1 — of edges. We prove that given a graph promised to be fc-colorable, it is NP-hard to find a k- 
coloring that properly colors more than a fraction « 1 — of edges. Previously, only a hardness factor 
of 1 — O(-p-) was known. Our result pins down the correct asymptotic dependence of the approximation 
factor on k. Along the way, we prove that approximating the Maximum 3-colorable subgraph problem 
within a factor greater than || is NP-hard. 

Using semidefinite programming, it is known that one can do better than a random coloring and 
properly color a fraction 1 — -i + of edges in polynomial time. We show that, assuming the 2-to-l 
conjecture, it is hard to properly color (using k colors) more than a fraction 1 — ^ + {^fr-) of edges 
of a fc-colorable graph. 
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1 Introduction 



1.1 Problem statement 

A graph G = {V, E) is said to be /c-colorable for some positive integer k if there exists a A;-coloring 
X '■ V {1,2, . . . ,k} such that for all edges [u, v) G E, x{u) ^ xi^). For k ^ 3, finding a A;-coloring of 
a fc-colorable graph is a classic NP-hard problem. The problem of coloring a graph with the fewest number 
of colors has been extensively studied. In this paper, our focus is on hardness results for the following 
maximization version of graph coloring: Given a fc-colorable graph (for some fixed constant k ^ 3), find a 
/c-coloring that maximizes the fraction of properly colored edge. (We say an edge is properly colored under 
a coloring if its endpoints receive distinct colors.) Note that for /c = 2 the problem is trivial — one can find 
a proper 2-coloring in polynomial time when the graph is bipartite (2-colorable). 

We will call this problem Max fc-Colorable Subgraph. The problem is equivalent to partitioning 
the vertices into k parts so that a maximum number of edges are cut. This problem is more popularly 
referred to as Max fc-Cut in the literature; however, in the Max k-Cut problem the input is an arbitrary 
graph that need not be fc-colorable. To highlight this difference that our focus is on the case when the 
input graph is fc-colorable, we use Max fc-Colorable Subgraph to refer to this variant. We stress that we 
will use this convention throughout the paper: Max A;-Colorable Subgraph always refers to the "perfect 
completeness" case, when the input graph is k-colorable^ Since our focus is on hardness results, we note 
that this restriction only makes our results stronger. 

A factor a = au approximation algorithm for Max /c-Colorable Subgraph is an efficient algorithm 
that given as input a A;-colorable graph outputs a fc-coloring that properly colors at least a fraction a of 
the edges. We say that Max A;-Colorable Subgraph is NP-hard to approximate within a factor [5 if no 
factor /3 approximation algorithm exists for the problem unless P = NP. The goal is to determine the 
approximation threshold of Max fc-Colorable Subgraph: the largest a as a function of k for which a factor 
a approximation algorithm for Max fc-Colorable Subgraph exists. 

1.2 Previous results 

The algorithm which simply picks a random /c-coloring, without even looking at the graph, properly colors 
an expected fraction 1 — 1/fc of edges. Frieze and Jerrum 131 used semidefinite programming to give a 
polynomial time factor 1 — l/fc + 21n/c/A;^ approximation algorithm for Max A;-Cut, which in particular 
means the algorithm will color at least this fraction of edges in a /c-colorable graph. This remains the 
best known approximation guarantee for Max A;-Colorable Subgraph to date. Khot, Kindler, Mossel, and 
O'Donnell Q showed that obtaining an approximation factor of 1 — l/A;+2 In A;//c^ + r2(lnln /c//c^) for Max 
/c-Cut is Unique Games-hard, thus showing that the Frieze-Jerrum algorithm is essentially the best possible. 
However, due to the "imperfect completeness" inherent to the Unique Games conjecture, this hardness result 
does not hold for Max fc-Colorable Subgraph when the input is required to be fc-colorable. 

For Max /c-Colorable Subgraph, the best hardness known prior to our work was a factor 1 — 
This is obtained by combining an inapproximability result for Max 3-Colorable Subgraph due to Pe- 
trank [11] with a reduction from Papadimitriou and Yannakakis [10|. It is a natural question whether is 
an efficient algorithm that could properly color a fraction 1 — l/k^'^^ of edges given a A;-colorable graph 
for some absolute constant e > 0. The existing hardness results do not rule out the possibihty of such an 
algorithm. 

'while a little non-standard, this makes our terminology more crisp, as we can avoid repeating the fact that the hardness holds 
for fc-colorable graphs in our statements. 
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For Max k-Cut, a better hardness factor was shown by Kann, Khanna, Lagergren, and Panconesi Q — 
for some absolute constants (3 > a > 0, they showed that it is NP-hard to distinguish graphs that have a 
fc-cut in which a fraction (l — a/k) of the edges cross the cut from graphs whose Max fc-cut value is at most 
a fraction (1 — P/k) of edges. Since MaxCut is easy when the graph is 2-colorable, this reduction does not 
yield any hardness for Max /c-Colorable Subgraph. 

1.3 Our results 

Petrank ifTTIl showed the existence of a 70 > such that it is NP-hard to find a 3-coloring that properly 
colors more than a fraction (1 — 70) of the edges of a 3-colorable graph. The value of 70 in [11 J was left 
unspecified and would be very small if calculated. The reduction in ifTTl was rather complicated, involving 
expander graphs and starting from the weak hardness bounds for bounded occurrence satisfiability. We 
prove that the NP-hardness holds with 70 = g^. In other words, it is NP-hard to obtain an approximation 
ratio bigger than || for Max 3-Colorable Subgraph. The reduction is from the constraint satisfaction 
problem corresponding to the adaptive 3-query PCP with perfect completeness from |4|. 

By a reduction from Max 3-Colorable Subgraph, we prove that for every k ^ 3, the Max A:-Colorable 
Subgraph is NP-hard to approximate within a factor greater than ^ I — ^ (Theorem H]). This identifies 
the correct asymptotic dependence on k of the best possible approximation factor for Max fc-Colorable 
Subgraph. The reduction is similar to the one in [5], though some crucial changes have to be made in 
the construction and some new difficulties overcome in the soundness analysis when reducing from Max 
3-Colorable Subgraph instead of MaxCut. 

In the quest for pinning down the exact approximability of Max A;-Colorable Subgraph, we prove 
the following conditional result. Assuming the so-called 2-to-l conjecture, it is hard to approximate Max 
fc-Colorable Subgraph within a factor 1 - ^ + O Qfr)- In other words, the Frieze- Jerrum algorithm is 
optimal up to lower order terms in the approximation ratio even for instances of Max A;-Cut where the graph 
is k-colorable. 

Unlike the Unique Games Conjecture (UGC), the 2-to-l conjecture allows perfect completeness, i.e., 
the hardness holds even for instances where an assignment satisfying all constraints exists. The 2-to-l 
conjecture was used by Dinur, Mossel, and Regev 121 to prove that for every constant c, it is NP-hard to 
color a 4-colorable graph with c colors. We analyze a similar reduction for the /c-coloring case when the 
objective is to maximize the fraction of edges that are properly colored by a ^-coloring. Our analysis uses 
some of the machinery developed in |2|, which in turn extends the invariance principle of [8 |. The hardness 
factor we obtain depends on the spectral gap of a certain A;^ x k"^ stochastic matrix. 

Remark 1. In general it is far from clear which Unique Games-hardness results can be extended to hold 
with perfect completeness by assuming, say, the 2-to-l (or some related) conjecture. In this vein, we also 
mention the result of O'Donnell and Wu |j9| who showed a tight hardness for approximating satisfiable 
constraint satisfaction problems on 3 Boolean variables assuming the d-to-l conjecture for any fixed d. 
While the UGC assumption has led to a nearly complete understanding of the approximability of constraint 
satisfaction problems |[T2l . the approximability of satisfiable constraint satisfaction problems remains a 
mystery to understand in any generality. 

Remark 2. It has been shown by Crescenzi, Silvestri and Trevisan [1] that any hardness result for weighted 
instances of Max fc-Cut carries over to unweighted instances assuming the total edge weight is polynomially 
bounded. In fact, their reduction preserves fc-colorability, so an inapproximability result for the weighted 
Max fc-Colorable Subgraph problem also holds for the unweighted version. Therefore all our hardness 
results hold for the unweighted Max A;-Colorable Subgraph problem. 
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2 Unconditional Hardness Results for Max /c-Colorable Subgraph 

We will first prove a hardness result for Max 3-Colorable Subgraph, and then reduce this problem to Max 
A;-Colorable Subgraph. 

2.1 Inapproximability result for Max 3-Colorable Subgraph 

Petrank HIl showed that Max 3-Colorable Subgraph is NP-hard to approximate within a factor of (1 -70) 
for some constant 70 > 0. This constant 70 is presumably very small, since the reduction starts from 
bounded occurrence satisfiability (for which only weak inapproximability results are known) and uses ex- 
pander graphs. We prove a much better inapproximability factor below, via a simpler proof. 

Theorem 1 (Max 3-Colorable Subgraph Hardness). The Max 3-Colorable Subgraph problem is NP- 
hard to approximate within a factor 0/ 1| + efor any constant e > 0. 

Proof. For the proof of this theorem, we will use reduce from a hard to approximate constraint satisfaction 
problem (CSP) underlying the adaptive 3-query PCP given in 14]. This PCP has perfect completeness and 
soundness 1/2 + e for any desired constant e (which is the best possible for 3-query PCPs). 

We first state the properties of the CSP. An instance of the CSP will have variables partitioned into three 
parts X , y and Z. Each constraint will be of the form (xj V {Yj = Zk)) A (xj V {Yj = zi)), where xt G X, 
Zk, zi ^ Z are variables (unnegated) and Yj is a literal (Yj G {yj,yj} for some variable yj G 3^). For YES 
instances of the CSP, there will be a Boolean assignment that satisfies all the constraints. For No instances, 
every assignment to the variables will satisfy at most a fraction (1/2 + e) of the constraints. 

Remark 3. We remark the condition that the instance is tripartite, and that the variables in Z never appear 
negated are not explicit in H. But these can be ensured by an easy modification to the PCP construction 
in H. The PCP in lH has a bipartite structure: the proof is partitioned into two parts called the A-tables 
and i?-tables, and each test consists of probing one bit A{f) from an A table and 3 bits B{g), B{gi), B{g2) 
from the B table, and checking {A{f) V {B{g) = B{gi)) A (aJJ) V {B{g) = -8(52))- Further these tables 
are folded which is a technical condition that corresponds to the occurrence of negations in the CSP world. 
If the queries at locations gi and g2 are made in a parallel C-table, and even if the C-table is not folded 
(though the A and B tables need to be folded), one can verify that the analysis of the PCP construction still 
goes through. This then translates to a CSP with the properties claimed above. 

Let I be an instance of such a CSP with m constraints of the above form on variables V = X U y U Z. 
Let X = {xi,X2, . . . , Xn-^}, y = {2/1,2/2, • • • , 2/n2} and Z = {zi,Z2, - ■ ■ , Zn^}- From the instance I we 
create a graph G for the Max 3-Colorable Subgraph problem as follows. There is a node Xi for each 
variable Xi G X,a. node zi for each zi G Z, and a pair of nodes {yj , yj} for the two literals corresponding to 
each yj G 3^. There are also three global nodes {R, T, F} representing boolean values which are connected 
in a triangle with edge weights m/2 (see Fig. [Hi. 

For each constraint of the CSP, we place the local gadget specific to that constraint shown in Figure |2] 
Note that there are 10 edges of unit weight in this gadget. The nodes yj,yj are connected to node i? by a 
triangle whose edge weights equal Wj = , Here A{X) denotes the total number of edges going 

from node X into all the local gadgets. The nodes Xi and zi connected to R with an edge of weight A{xi)/2 
and A{zi)/2 respectively. The proofs of the following lemmas appear in Appendix lAl 
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Figure 1 : Global gadget for truth value assign- 
ments. Blocks Xi, Yj and Zi are replicated for 
all vertices in 3^ and Z. Edge weights are 
shown next to each edge. 



Figure 2: Local gadget for each constraint of the 
form {xi V Yj = Zj^) A {xl V Y^- = zi). All edges 
have unit weight. Labels A, A' , B , B' refer to the 
local nodes in each gadget. 



Lemma 2 (Completeness). Given an assignment of variables a : V ^ {0, 1} which satisfies at least c of 
the constraints, we can construct a ^-coloring of G with at most m — c improperly colored edges ( each of 
weight 1 ). 

Lemma 3 (Soundness). Given a 3-coloring ofG, x, such that the total weight of edges that are not properly 
colored by x is at most r < m/2, we can construct an assignment a' : V ^ {0, 1} to the variables of the 
CSP instance that satisfies at least m — t constraints. 

Returning to the proof of Theorem [T] the total weight of edges in G is 

10m + — + J] --^ + 3«;, + ^ ^ = -m + - ^(A(2/,) + A(^)) = -m . 



2 2 2 

i=i j=i 1=1 j=i 



2m 



By the completeness lemma, YES instances of the CSP are mapped to graphs G that are 3-colorable. By 
the soundness lemma, No instances of the CSP are mapped to graphs G such that every 3-coloring miscolors 
at least a fraction ^'^gl^^'' = of the total weight of edges. Since e > is an arbitrary constant, the proof 
of Theorem [T]is completejl □ 



2.2 Max fc-Colorable Subgraph Hardness 

Theorem 4. For every integer k ^ 3 and every e > 0, it is NP-hard to approximate Max /c-Colorable 
Subgraph within a factor ofl — + e where = k mod 3^2. 

Proof We will reduce Max 3-Colorable Subgraph to Max fc-Colorable Subgraph and then apply The- 
orem [T] Throughout the proof, we will assume k is divisible by 3. At the end, we will cover the remaining 
cases also. The reduction is inspired by the reduction from MaxCut to Max A;-Cut given by Kann et al. [51 
(see Remark!!]). Some modifications to the reduction are needed when we reduce from Max 3-Colorable 

^Our reduction produced a graph with edge weights, but by Remark[2l the same inapproximability factor holds for unweighted 
graphs as well. 
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Subgraph, and the analysis has to handle some new difficulties. The details of the reduction and its analysis 
follow. 

Let G = {V, E) be an instance of Max 3-Colorable Subgraph. By Theorem[T] it is NP-hard to tell if G 
is 3-colorable or every 3-colors miscolors a fraction ^ — e of edges. We will construct a graph H such that 
H is A;-colorable when G is 3-colorable, and a /c-coloring which miscolors at most a fraction /i of the total 
weight of edges of H implies a 3-coloring of G with at most a fraction jik of miscolored edges. Combined 
with Theorem[Il this gives us the claimed hardness of Max A;-Colorable Subgraph. 

Let K'f^i^ denote the complete graph with loops on A;/3 vertices. Let G' be the tensor product graph 
between and G, G' = K'^i^ (g) G as defined by Weichsel [HI- Identify each node in G' with {u, i),u € 
V{G),i G {1, 2, . . . , k/3}. The edges of G' are {{u, i), {v, i')) for {u, v) e E and any i, i' G {1, . . . , k/3}. 
Next we make 3 copies of G', and identify the nodes with (u, i, j), {u, i) G V{G'),j G {1, 2, 3}, then put 
edges between all nodes of the form (u, j) and (n, i' if either i ^ i' or j ^ j' with weight "^du, where 
du is degree of node u. The total weight of edges in this new construction H equals 

2 



5: Q^+Ki) '-r*'- 



Lemma S.IfG is 3-colorable, then H is k-colorable. 

Proof. Let XG '■ ^(G) — > {1, 2, 3} be a 3-coloring of G. Consider the following coloring function for H, 
XH ■ V{H) {1,2, .. . ,k}. For node {u,i,j), let XH{{u,i,j)) = 7t^{xg{u)) + 3{i - 1). Here tt is the 
/ 1 2 3 \ 

permutation ( ^ 3 ^ ) ' '^•'(x) = vr(. . . (7r( x))). Equivalently 7r(x) = x mod 3 + 1. 

j times 

Consider edges of the form {{u,i, j), {v,i' , j)}. If i 7^ i',then colors of the endpoints are different. Else 
we have x((^, hj)) — x{{v, hi)) = x{'^) — x{v) ^ mod 3. For edges of the form {(u, i, j), {u, i' ,j')}, if 
i 7^ z', clearly edge is satisfied. Wheni = j',x{{u,i,j))—xi{''^ihj')) = t^-'{u)—'k^ (u) = j—j' ^ 

mod 3. □ 

Lemma 6. If H has a k-coloring that properly colors a set of edges with at least a fraction (1 — jj) of the 
total weight, then G has a 3-coloring which colors at least a fraction (1 — ^k) of its edges properly. 

Proof Let xh be the coloring of H, Suggi = {xH{{u,i,j)) | 1 ^ i ^ k/3} and Sugg„ = UjSugg:^. 
Denote the total weight of uncut edges in this solution as 

o 

^total ^ ^ _^ ^within _j_ ^between 

u&V{G) ^ 

where (7^**'^*" and (7^etoeen denotes the number of improperly colored edges within the copies of node u 
and between copies of different vertices u,v £ V{G) respectively. We have the following relations: 

(jbetween ^ J2^j=l Huv<^E(G) Sl^i<i'^fc/3 '^XH{{u,i,j))=XH{(v,i' ,3)) 
> Ei=l Huv&EiG) ISuggi n Sugg:J| 
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C! 



ithin 



V /'lx^j'(c)nB„K 

Z^ceSugg,, I 2 / 



Bu,c I 



X]ceSugg„ 2 



2pni 

2 



ISuggu 



1 



(S„ = {(n,i,i)[Vi,i}) 
(Cauchy-Schwarz) 



(3) 



> k |Sugg„ 
^ 2 I Sugg,, I 



> |Sugg„ 
^ 2 



Now we will find a (random) 3-coloring XG for G. Pick c from {1,2,..., A;} uniformly at random. If 
c ^ Sugg„, select xg{u) uniformly at random from {1, 2, 3}. If c G Sugg„, set xg{u) = 3 if J is the smallest 
index for which c E Sugg-^(ti). With this coloring xg{u)^ the probability that an edge (n, v) G -E(G) will 
be improperly colored is: 

3 .1 

Pr [xg{u) = Xg{v)] ^ ^ Pre [c G Sugg^ n Sugg^] + -Pr^ [c G Sugg„, c G Sugg^] 



+ ^Prc [c G Sugg„, c G Sugg^] + ^Prc [c G Sugg„, c G Sugg^ 



^ [Sugg^nSugg^l _^ |Sugg„| _^ [Sugg 



A; 



3fc 



3A; 



We can thus bound the expected number of miscolored edges in the coloring XG as follows 

3 



E 



5Z -'^XG(«)=XG(f) 
l{u,v)&E{G) 



uv£E 



ISugg^nSugg^l 

A; 



_^ ISugg^l ^ jSugg^ 



3A; 



3A; 



^ ^l^^betn^een ^ ^ y|Suggj) (using ©) 



O J , rttotal 
^ ^ I ^between , \ ^ ^within\ ^ 

^ +2^ — )- 

u£V{G) 



1 



A; 



This implies that there exists a 3-coloring of G for which the number of improperly colored edges in G is at 
Therefore if H has a A: -coloring which improperly colors at most a total weight iik'^m of edges, 



most 



Qto 



then there is a 3-coloring of G which colors improperly at most a fraction 



km 



fik of its edges. 



□ 



This completes the proof of Theorem |4] when k is divisible by 3. The other cases are easily handled by 
adding k mod 3 extra nodes connected to all vertices by edges of suitable weight. See Appendix O for 
details. □ 

Remark 4 (Comparison to Q). The reduction of Kann et al Q converts an instance G of MaxCut to the 
instance G' = K'^i^ (g) G of Max A;-Cut. Edge weights are picked so that the optimal A:-cut of G' will give a 
set Su of A;/2 different colors to all vertices in each A:/2 clique (u, ^ k/2. This enables converting 

a A:-cut of G' into a cut of G based on whether a random color falls in Su or not. In the 3-coloring case, we 
make 3 copies of G' in an attempt to enforce three "translates" of Su, and use those to define a 3-coloring 
from a A;-coloring. But we cannot ensure that each A;-clique is properly colored, so these translates might 
overlap and a more careful soundness analysis is needed. 
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3 Conditional Hardness Results for Max /c-Colorable Subgraph 



We will first review the (exact) 2-tO-l Conjecture, and then construct a noise operator, which allows us to 
preserve A;-colorability. Then we will bound the stability of coloring functions with respect to this noise 
operator. In the last section, we will give a PCP verifier which concludes the hardness result. 

3.1 Preliminaries 

We begin by reviewing some definitions and d-to-l conjecture. 

Definition 1. An instance of a bipartite Label Cover problem represented as C = {U, V, E, W, Rjj, -Ry, 11) 
consists of a weighted bipartite graph over node sets U and V with edges e = {u,v) (z E of non-negative 
real weight We G W. Rjj and Ry are integers with 1 ^ Ru ^ Ry- 11 is a collection of projection 
functions for each edge: 11 = {t^vu '■ {1; • • ■ ,Rv} ~^ {1) • • • }Ru}\'^ £ C/, w G V}. A labeling i is a 
mapping i : \J {1, . . . , Rjj}, i '■ V {1, . . . , Ry}- edge e = (n, v) is satisfied by labeling i if 
Tre{i{v)) = i{u). We define the value of a labeling as sum of weights of edges satisfied by this labeling 
normalized by the total weight. Opt{£.) is the maximum value over any labeling. 

Definition 2. A projection tt : {1, . . . , Ry} {1, • • • , Ru} called d-tO-1 if for each i G {1, . . . , Rjj}, 
\TT^^{i)\ ^ d. It is called exactly d-tO-1 if\7r^^{i)\ = dforeachi G {1,2,... ,Ru}- 

Definition 3. A bipartite Label-Cover instance C is called d-to-1 Label-Cover if all projection functions, 
vr G n are d-to-1. 

Conjecture 1 (d-tO-l Conjecture ||6l). For any 7 > 0, there exists a d-\0-l Label-Cover instance C with 
Ry = R{^) and Ru ^ dRy many labels such that it is NP-hard to decide between two cases, Opt(£) = 1 
or Opt(£) ^ 7. Note that although the original conjecture involves d-tO-1 projection functions, we will 
assume that it also holds for exactly d-lO-1 functions (so Ru = dRy), which is the case in 

Using the reductions from |f2l|, it is possible to show that the above conjecture still holds given that the 
graph {U U V, E) is left-regular and unweighted, i.e., We = 1 for all e £ E. 

3.2 Noise Operators 

For a positive integer M, we will denote by [M] the set {0, 1, . . . , M — 1}. We will identify elements of 
[M^] with [M] X [M] in the obvious way, with the pair (a, b) G [M]^ corresponding a + Mb G [M^]. 

Definition 4. A Markov operator T is a linear operator which maps probability measures to other probabil- 
ity measures. In a finite discrete setting, it is defined by a stochastic matrix whose (x, y) 'th entry T{x y) 
is the probability of transitioning from x to y. Such an operator is called symmetric ifT{x — > y) = T(y — > 
x) = T(x<-^•y). 

Definition 5. Given p G [—1,1], the Beckner noise operator, Tp on [q\ is defined by as Tp{x — > x) = 



Observation!. All eigenvalues of the operator Tp are given by 1 = Aq (Tp) ^ Ai (Tp) = . . . = Ag_i (Tp) = 
p. Any orthonormal basis ao, ai, • • • , Og-i with being constant vector, is also a basis for Tp. 




y) = g(l - p) for any x ^ y. 
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Lemma 7. For an integer q ^ 6, there exists a symmetric Markov operator T on [q]'^ whose diagonal 
entries are all and with eigenvalues 1 = Aq ^ Ai ^ . . . ^ ^q'^-i such that the spectral radius p{T) = 
max{|Ai|, |Ag2_ij} is at most ^^r- 

Proof. Consider the symmetric Markov operator T on [q]"^ such that, for x = (xi, X2), y = (yi, 2/2) G [q]^. 



where a 



1 



T{x^y) 



and /3 



a if {xi,X2} n {yi,y2} = and xi / X2,yi / y2, 
/? if xi ^ {yi,y2} and xi = X2,yi / y2, 
(3 if yi {xi,X2} andxi / X2,yi = y2. 
else, 

-. It is clear that T is symmetric and doubly stochastic. 



(g-l){g-3) ^ (g-l)(g-2)- 

To bound the spectral radius of T, we will bound the second largest eigenvalue Ai(T^) of T^. No- 
tice that is also a symmetric Markov operator. Moreover Ai(r^) = Xf{T), therefore Ai(T^) ^ 
max(A?(r),A2,_^(r))^/,(r)2. 

Notice that T'^{x^y) > for all pairs x,y € [q]"^. Consider the variational characterization of 1 — 

ai(t2) m-. 

. Ecc,yWx)-i^{y)fAx)T^{x^y) . . vr(x)(V(x) -V(y))'r2(xwy) . 

mm — — — -— If \\2 f \ f \ — ^ mm mm — — — , f ^ \ / \ — = mmg T (x^y 

^ - 7r(x)7r(y) V ^-?/ (i/'ix) - V(2/))^vr(x)7r(y) x,y 

For any two pairs (xi, X2), (yi, y2) G [g]^, let / = \ {xi, X2, yi, y2}|- Then we have 

'/(/ - ^ _ 2)(g - 3)/32 if xi = X2 and yi = y2, 

/(/ — l)aP ^ {q — 3){q — 4)a/3 if xi / X2 and yi = y2, 

— l)a/5 ^ (g — 3)((7 — 4)a/3 if xi = X2 and yi 7^ y2, 

Z(Z - l)a^ + W"^ ^ (g - 4)(g - 5)a2 + {q - A)(3'^ if xi X2 and yi y2. 
(g-5)(g-4) 



r^((xi,X2)^(yi,y2)) 



(g- 3)2(5 -2)(9-l) 



So p{T) i^^X^)^Jl 



, KJfKtl'^u ^ ^ + 4 ^ A f or O 6. 



□ 



3.3 g-ary Functions, Influences, Noise stability 

We define inner product on space of functions from [g]^ to M as (/, g) = E^.^jg]jv [f{x)g{x)]. Here x ~ P 
denotes sampling from distribution V and V = [q]'^ denotes the uniform distribution on [q]^ ■ 

Given a symmetric Markov operator T and x = (xi, . . . ,X7v) G [5]^, let T^^x denote the prod- 
uct distribution on [q]'^ whose entry y^ is distributed according to T{xi^yi). Therefore T®^ f{x) = 



Definition 6. Let ao,ai, . . . , Oq-i be an orthonormal basis ofW^ such that ao is all constant vector For 
x € [q]^ , we define ax € M'^^ as 



a 
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Definition 7 (Fourier coefficients). For a function f : [q]^ — s- R, define /(a^) = {f, 

Definition 8. Let f : [g]^ -^M.be a function. The influence ofi'^ variable on f, lnfj(/) is defined by 

lnfi(/) = E [Var [f{x)\xi, x^+i, . . . , xn]] 
where xi, . . . ,xn <^re uniformly distributed. Equivalently, Inf = X^x-x-^^o /^('^^)- 
Definition 9. Let f : [q]'^ —>■ be a function. The low-Ievel influence o//*'' variable of f is defined by 

Inff (/) = E 

Observation 2. For any function f, Inff (/) = Ex;|x|<i < t Ex = V 

f : [q]^ ^ [0, 1], then Wfg ^ I. so E. Inff (/) ^ t. 

Definition 10 (Noise stability). Let f be a function from [q]^ to M, and let —1 ^ p 1. Define the noise 
stability of f at p as 

S,{f) = {f,T^V) = Y,p\^\f!{a,) 

X 

where Tp is the Beckner operator as in Definition \5\ 

A natural way to think about a g-coloring function is as a collection of g-indicator variables summing to 
1 at every point. To make this formal: 

Definition 11. Define the unit q-simplex as = {(xi, . . . , Xq) ^W\^Xi = 1, Xi ^ 0}. 

Observation 3. For positive integers Q, q and any function f = (/i, . . . , fg) : [Q]^ Ag, Ej = 
E.E,lnff(/,)^tE,ll/.f ^i- 

We want to prove a lower bound on the stability of g-ary functions with noise operators T. The following 
proposition is generalization of Proposition 1 1 .4 in [7] to general symmetric Markov operators T with small 
spectral radii. Its proof appears in Appendix iBl 

Proposition 1. For integers Q,q ^ 3, and a symmetric Markov operator T on [Q] with spectral radius 
P(^) ^ ^r4> f<^f some c > 0, there is a small enough 5 = 5{q) > and t = t{q) > such that for any 
function f = {fi, . . . , fg) : [Q]'^ Ag with \nff'^{f ) ^ 6, for all i, satisfies 

q 

Y.^fj^T^'^fj) ^ l/q-2c\nq/q^-C\n\nq/q^ 
i=i 

for some universal constant C < oo. 

Definition 12 (Moving between domains). For any x = (xi, . . . , X2n) £ M^^. denote x € [g^]^ as 

X = ((Xi, X2), . . . , ix2N-l,X2N)) ■ 

Similarly for y = (yi, . . . , y^v) S [q'^]'^> denote y G [q]'^'^ as 

y = {yi,i,yi,2, - ■ ■ ,yN,i,yN,2), 

where yi = yi^i+yi^2Q such that yi^i,yi^2 ^ [q]- For a function f on [g]^^, define f on [q'^]'^ as f{y) = f{y). 
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The relationship between influences of variables for functions / and / are given by the following claim 
(Claim 2.7 in Q). 

Claim 1. For any function f : [q]"^^ M, i G {1, . . . , N} and any t ^ 1, \nff(f) lnf|i^\(/) + 
Inf2f*(/)- 

3.4 PCP Verifier for Max fc-Colorable Subgraph 

This verifier uses ideas similar to the Max k-Cut verifier given in Q and the 4-coloring hardness reduction 
in ||2l. Let C = {U, V, E, R, 2R, H) be a 2-tO-l bipartite, unweighted and left regular Label-Cover instance 
as in Conjecture [T] Assume the proof is given as the Long Code over [k]'^^ of the label of every vertex v £ V. 
Below for a permutation o" on {1, . . . , n} and a vector x G R", x o a denotes (^^(i) , Xa-(2) , • ' ' ) ^a{n))- For 
a function / on M", / o cr is defined as / o a{x) = f{x o a). 

• Pick u uniformly at random from U,u ^ U. 

• Pick V, v' uniformly at random from u's neighbors. Let vr, vr' be the associated projection functions, 
Xv,Xv' be the (supposed) Long Codes for the labels oiv,v' respectively. 

• Let T be the Markov operator on [k]^ given in Lemma|7] Pick x ~ [k"^]^ and y ~ T®^x. Let a^, a^i 
be two permutations of {1, . . . ,2R} such that 7r{a~^{2i - 1)) = 7r(o-;7^(22)) = 7r'{a~,^{2i - 1)) = 
7r'{a~^{2i)) (both vr and vr' are exactly 2-tO-l, so such permutations exist). 

• Accept iff Xv ° (^v (x) and Xv' ° (^v' (y) are different. 
The proofs of the following two lemmas appear in Appendix ICl 

Lemma 8 (Completeness). If the original 2-tO-l Label-Cover instance C has a labeling which satisfies all 
constraints, then there is a proof which makes the above verifier always accept. 

Lemma 9 (Soundness). There is a constant C such that, if the above verifier passes with probability exceed- 
ing l — l/k + Oilnk/k"^), then there is a labeling of C which satisfies 7' = j' (k) fraction of the constraints 
independent of label set size R. 

Note that our PCP verifier makes "fe-coloring" tests. By the standard conversion from PCP verifiers 
to CSP hardness, and Remark |2] about conversion to unweighted graphs with the same inapproximability 
factor, we conclude the main result of this section by combining Lemmas [8] and |9l 

Theorem 10. For any constant k ^ 3, assuming 2-tO-l Conjecture, it is NP-hard to approximate Max 
A:-Colorable Subgraph within a factor of I - l/k + 0{lnk/k'^). 
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A Proofs from Section IXT 
A.l Proof of Lemma |2] 

Proof. We define the coloring x '■ ^{G) — > [3] in the obvious way, with nodes T, R and F fixed to different 
colors. Then define 

ixiT) if a{xi) = I, 
\ x{F) else. 



and similarly for the nodes yj, zi. Define 

xi¥i 



X{F) ifa{yj) = l, 
X{T) else. 
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Now, for the constraints satisfied by this assignment, {xi V [Yj = z^)) A {xi y {Yj = zi)), consider 
the corresponding gadget. Let Sugg(^) = [3] \ {x{xi),x{T)} and Sugg(5) = [3] \ {x{Yj),x{zk)} be the 
available colors to A and B which can properly color all edges incident to variables. Notice that none of 
these sets are empty and since Xi V {Yj = z^) is true, at least one of these sets Sugg(j4) and Sugg(i?) has 
two elements in it. Hence there exists a coloring of A and B from sets Sugg(A) and Sugg(i?) such that 
x(^) / x{B)- The same argument also holds for A' and B' , therefore all edges in this gadget are properly 
colored. 

For the violated constraints, either Sugg(yl) or Sugg(yl') has one element. Augmenting that set with the 
color x{xi) will cause only one edge to be violated. □ 

A.2 Proof of Lemma |3] 

Proof. Since r < m/2, the coloring x must give three different colors to the nodes T, F, and R. If 
x{xi) = x{R)-: then randomly choosing xi^i) from {x{T),x{P)} will, in expectation, make at most half 
of the local gadget edges going out of Xi improperly colored, which is exactly the value /S.{xi)/2 gained. 
So we can assume that xi^i) € {x{T), x{P)} for £^ch Xj. A similar argument holds for the nodes z;. Now 
consider the nodes yj and for a variable in Y . If xiVj) = x(-R)> x(y7) ~ or x{xj) = xiVj)^ then 
randomly choosing {x{yj)iX{yj)) from {{x{T) , x{F)) , will, in expectation, make at most 

half of the local gadget edges going out of nodes yj and improperly colored, which is exactly the value 
Wj gained. 

To summarize, we can assume that nodes T,F and R are colored differently, x(3^i)) x(^)) ^ 
{x{T),x{F)} and x{yj) / xiVj)- Thus all edges other than the edges inside the local gadgets are properly 
colored by x> and by assumption at most r edges are miscolored by x- 

Now define the natural assignment a' that assigns a variable of V the value 1 if the associated variable 
received the color x(^)' and the value if its color is x(^)- 

Consider a local gadget, with all edges properly colored, corresponding to the constraint (xj V (Yj = 
Zk)) A (xj V (Yj = zi)). Assume a'{xi) = 0, which implies x(^) = x(-R)- Then both neighbors of B 
besides A must have the same color, therefore a(Yj) = (j{zk). The other case when o"'(xi) = 1 is similar. 
Hence the assignment a' will satisfy this constraint. 

Since the local gadgets corresponding to different constraints have disjoint sets of edges, it follows that 
the number of constraints violated by the assignment a' is at most r. □ 

B Proof of Proposition [H 

Proof. Let t = 4, fi : [Q]^ — > [0, 1] denote the coordinate function of /, and let /ij = E [/j]. Let 
ao, . . . , aq-i be an orthonormal set of eigenvectors for T with corresponding eigenvalues Aq ^ • • • ^ -^Q-i' 
with p = p{T) ^ -^pj being the spectral radius of T. Notice that T is symmetric so Ao = 1 and ao is a 
constant vector. Therefore E [/j] = fi{ao) = p.i. Then (using the notation from Q): 

and hence 
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At this point, consider the Beckner operator, Tp on [Q]. Since ao is the uniform distribution, it is a 
constant vector, thus ao, ai, . . . , ag-i is also an orthonormal basis for Tp. Consequently, 



Thus 



f^-pl^'l if|x|y^O, 

I = 1 else. 

X x:\x\!^A a;:|a;|>4 

x:\x\^A 

^ 2f^^- p^^^mc^x)-q-' 



a;:|a;|^4 



At this point, let fi{x) = J2x-\x\^4(n.a^o ^^a^'')fi{'^x)ctx be the function having the same low-level coef- 
ficients with fi{x) and for the higher-levels. It is easy to verify that E /j = /ij, \nU{fj) ^ lnfj(/j) = 

Inff (/,)and§p(/,) = E..;|..|<4 In particular, our assumption Inff (/,) = Ej '"ff (/j) ^ 
6 implies J2j ^ ^■ 

Let J be a small enough constant such that ^ r_£_(/xi) + e for some small e ^ ^, from 

the Majority is Stablest Theorem HI. In Q, A^(//) is used for r^(/i) and we will follow that convention 
instead. Below, for a real x, [x]"*" denotes max{x, 0}. Then 



i i 
i 



2q 



> - 



1 2c\nq / In In g \ 



O 



The last inequality is proved in the same way as Proposition 1 1.4 in Q. The only difference is that we have 

= + ^2Mf In(lM) . (l + C7i^) 



and 



i=l 



i=l 
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which is convex because /Xj ^ and minimized at /ij = \/q. In this case, we have 



Y,{2^l} - Fijii)) ^ q - 2cq~Hnq{l + C Inln q / In q) 



i=l 

from which the above claim follows. 



□ 



C Analysis of PCP verifier for Max /c-Colorable Subgraph 
C.l Proof of Lemma M 

Proof. Let i : V {1, . . . , 2R} be a labeling for £ satisfying all constraints in H. Pick Xv as the Long 
Code encoding of i{v). Given any pair of vertices v,v' £ V which share a common neighbor u £ U, and 
x,y £ [k]"^^' pairs such that 



Pr [y 



let 7r,7r' be the projection functions and <T„,cr^' be the permutations as defined in the description of the 
verifier. We have Xv{x o a^) = and Xv'iv ° o"t,') = ya'{e{v'))- Since 7r(£(w)) = tt'{£{v')), this 

implies (Ty{£{v)), ay/{£{v')) G {2i — 1, 2i} for some i ^ R. But 

T{{x2i^l,X2i)^{y2i-l,y2i)) > =^ {x2i~l,X2i} n{y2i-i,y2i} = ^ , 

therefore Xv o (7v{x) = / y„^,(i(v')) = Xv' ° cFv'iv)- So the verifier always accepts. □ 

C.2 Proof of Lemma m 

Proof. For each node v ^V, let f" : [k]'^^ — > be the function f^{x) = e^^^^-^ where Cj is the indicator 
vector of the coordinate. Let r(u) denote the set of vertices adjacent to u in the Label Cover graph. 
After arithmetizing, we can write the verifier's acceptance probability as 



Pr [acc] 



1-E„ 
1-E„ 
1 - E,, 



^ 1-1/k + Clnk/k'^ 



'D~r(u) 



/Jo a.]) 



where : [k"^]^ and some constant C. By averaging, for at least a fraction 6 = {e/2) hi k/k"^ of 

vertices in U, we have 

3 

Let these be "good" vertices. For a good vertex, by Proposition [T] there exist constants 5 = 5{k), t = t{k) 
and i such that lnff*(c/") ^ 6. Let Sugg„ = € {1,...,R} A lnff*(c/") ^ 6}, so |Sugg„[ ^ 1. By 
Observation [3l |Sugg„| For a good vertex u, and j G Sugg„: 



E. 



't;~r(u) 
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Therefore, for at least a fraction 5/2 of neighbors v of u, lnf> [f" o a^) ^ 5/2. For such v and j, by 
ClaimE \n'iff\{P o a^) + Inf o o-„) ^ 5/2. Therefore for some j G [2i?], Inf|^^*(/^) ^ 5/L Let 

Sugg^ = e {1, . . . , 2i?} A Inff *(r ) ^ 5/4}. Again, Sugg^ is not empty and |SuggJ ^ St/ 5. 

Following the decoding procedure in ||7l, we deduce that it is possible to satisfy a fraction 7' = ^'{5, t) = 
7' (A;) of the constraints. □ 



D Handling k not divisible by 3 in Theorem IH 

We now argue how to handle the case when k mod 3 7^ in the statement of Theorem |4] Assume k is 
of the form K + L, where K = Q (mod 3) and L = k mod 3 G {1, 2}. We will give a reduction from 
Max i^-Colorable Subgraph, which we already showed to be NP-hard to approximate within a factor 
1 - + e, to Max fc-Colorable Subgraph. 

Let Gk be an (unweighted) instance of Max ET-Colorable Subgraph with M edges. Construct a graph 
H by adding L new vertices ui, ... ,ul to Gk- Each Ui is connected by an edge of weight ^ to each vertex 
V E V{Gk), where is the degree of v in Gk- If L > 1, (ui, U2) is an edge in H with weight The 
total weight of edges in H equals 

,^ 2LM M(L-l) 
M' = M H \ . 

K 2,m 

Clearly if Gk is i^-colorable, then H is fe-colorable. For the soundness part, suppose every i('-coloring 
of Gk miscolors at least i^-^j^ — e^M edges. Let x be an optimal /c-coloring of H. We will prove that 

X miscolors edges with total weight at least M{-^^ — e). This will certainly be the case if L > 1 and 
x(iti) = x(^2)- So we can assume x uses L colors for the newly added vertices Ui. If x('f^) = xi^i) for 
some V G V{Gk), we can change x(w) to one of the K colors not used to color {ui, . . . , ml} so that the 
weight of miscolored edges does not increase. Therefore, we can assume that x uses only K colors to color 
the Gk portion of H. But this implies at least M{^j^ — e) edges are miscolored by x, as desired. 
Thus every /c-coloring of H miscolors at least a fraction 

M(1/(33K) - e) _ (l/(33K)-e) 1 



M' 1 + 2L/K + {L- l)/(33i^) 33{k + L) + (L - 1) 

of the total weight of edges in H. Since L = k mod 3, the bound stated in Theorem|4]holds. 
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